Speaker recognition using a trajectory-based segmental HMM

نویسندگان

Ying Liu

Martin J. Russell

Michael J. Carey

چکیده

A segmental HMM is a HMM whose states are associated with sequences of acoustic feature vectors (or segments), rather than individual vectors. By treating segments as homogeneous units it is possible, for example, to develop better models of speech dynamics. This paper begins by describing a type of segmental HMM in which the relationship between the state and acoustic level descriptions of a speech signal is regulated by an intermediate, articulatory layer, and discusses its potential benefits for speaker recognition. As a first step towards applying this type of model to speaker recognition, text-dependent speaker verification results obtained on YOHO using a simpler segmental HMM are presented, which show a 44% reduction in false acceptances using the segmental model compared with a conventional HMM. Experiments in text-independent speaker verification on Switchboard are then described.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker adaptation of trajectory HMMs using feature-space MLLR

Recently, a trajectory model, derived from the hidden Markov model (HMM) by imposing explicit relationships between static and dynamic features, has been proposed. The derived model, named trajectory HMM, can alleviate two limitations of the HMM: constant statistics within a state and conditional independence assumption of state output probabilities. In the present paper, a speaker adaptation a...

متن کامل

Speech recognition using non-linear trajectories in a formant-based articulatory layer of a multiple-level segmental HMM

This paper describes how non-linear formant trajectories, based on ‘trajectory HMM’ proposed by Tokuda et al., can be exploited under the framework of multiple-level segmental HMMs. In the resultant model, named a non-linear/linear multiple-level segmental HMM, speech dynamics are modeled as non-linear smooth trajectories in the formant-based intermediate layer. These formant trajectories are m...

متن کامل

Elimination of trajectory folding phenomenon: HMM, trajectory mixture HMM and mixture stochastic trajectory model

In this paper, a study of topology of Hidden Markov Model (HMM) used in speech recognition is addressed. Our main contribution is the introduction of the notion of trajectory folding phenomenon of HMM. In complex phonetic contexts and in speaker-variability, this phenomenon degrades the discriminability of HMM. The goal of this paper is to give some explanation and experimental evidence suggest...

متن کامل

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...

متن کامل

A Recognition Method Using Synthesis-based Scoring That Incorporates Direct Relations between Static and Dynamic Feature Vector Time Series

It is well known that hidden Markov models (HMMs) can only exploit the time-dependence in the speech process in a limited way. Parametric trajectory models have been proposed to exploit this time-dependency. However, parametric trajectory modeling methods are unable to take advantage of efficient HMM training and recognition methods. This paper describes a new speech recognition technique that ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2004

Speaker recognition using a trajectory-based segmental HMM

نویسندگان

چکیده

منابع مشابه

Speaker adaptation of trajectory HMMs using feature-space MLLR

Speech recognition using non-linear trajectories in a formant-based articulatory layer of a multiple-level segmental HMM

Elimination of trajectory folding phenomenon: HMM, trajectory mixture HMM and mixture stochastic trajectory model

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

A Recognition Method Using Synthesis-based Scoring That Incorporates Direct Relations between Static and Dynamic Feature Vector Time Series

عنوان ژورنال:

اشتراک گذاری